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(57) ABSTRACT 

A compiler extension for the compile-time verification of 
programmer-defined data dependencies. The compiler 
extension provides for a check statement which takes as an 
argument a data-verification expression. The data- 
verification expression has a predetermined value when the 
programmer-defined data dependency is conformed to in the 
program. A compile-time error is generated if the data- 
verification expression does not have the predetermined 
value on the compilation of the check statement. The suc- 
cessful compilation of the check statement does not result in 
run-time code or run-time data allocation occurring. 

15 Claims, 4 Drawing Sheets 
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COMPILE-TIME DATA DEPENDENCY a data dependency is incorrectly used in a particular piece of 

VERIFICATION programming code which is not executed during testing of 

the program the error may not be uncovered until some time 

BACKGROUND OF THE INVENTION after the program is put into use. Furthermore, the identifi- 

1 T h * 1 F'eld 5 cat * on °^ errors during program testing does not occur at a 

single time; the program must be put through successive 

The present invention is directed to an improvement id tests to uncover each such error. Finally, such error identi- 

compilers and related software development tools and in fiction using, for example, the assert macro does not occur 

particular to the verification of data dependencies in com- unt y run-time. 

puter programs at compile-time. 10 R ^ therefore desirabk tQ have m improved com pil er , 

2. Prior Art assembler, or related tools, capable of verifying the correct 

It is common to utilize data dependencies in computer use of data subject to data dependencies, without introduc- 

programs. Data dependencies are used in computer pro- ing additional run-time costs or increased run-time code, 

gramming because they can provide the advantages of which will identify data dependency errors at a single time, 

memory efficiencies and run-time execution efficiencies. is an d which compiler will not rely on run-time testing of the 

Where data dependencies are created, program code can code to uncover such errors, 

be written which will take advantage of characteristics or ™ Twm _ _ » T 

rules relating to the data structures which are not expressly SUMMARY OF THE INVENTION 

defined in the programming language, or the data type According to one aspect of the present invention, there is 

definition, themselves. Examples of such data dependencies 20 provided an improved compiler. 

include requirements that two different data structures have According to a further aspect of the present invention, 

identical lengths or that two data elements be adjacent to one mcrc ^ provided an improved compiler capable of perform- 

another in memory. The programmer can take advantage of mg comp iie-time verification of data dependencies, 

these known relationships between data structures to create According t0 a further aspect of the present invention, 

programs which execute more quickly or with the need for ^ ^ ided a flcr exteasion a data 

less memory than would otherwise be the case. verification component for compiling a data-dependency 

Good programming practice requires that such assump- expression contained in a computer program, the data- 
dons or rules relating to how the data may be manipulated dependency expression reflecting a programmer-defined 
(the data dependencies) are indicated in the documentation ^ relationship between data in the computer program, the 
accompanying the computer code. However, the error-free data-dependency expression having a one of a first set of 
use of data which is subject to the data dependencies relies predefined values when the program conforms to the 
on the understanding of programmers, rather than on any programmer-defined relationship between data, and the 
restriction imposed by the compiler. Problems may arise data-dependency expression having a one of a second set of 
when modifications are made to the computer program at ^ predefined values different from the first set of predefined 
which time the documentation is not referenced or available, values, when the computer program does not conform to the 
or where the documentation is poorly prepared or inad- user-defined relationship between data, wherein upon com- 
equately laid out. piling the data -dependency expression, the data verification 

Furthermore, in writing complex computer systems, it is component generates no run-time code or run-time data 
often preferable to define data or data types at a particular ^ storage allocation, and upon compiling the data-dependency 
point in a computer program and to make use of the data or expression, the data verification component generates a 
data type at different points in the computer systems or compile-time error message when the value of the data- 
programs which may be remote from the data definition dependency expression is in the second set of predefined 
point. The data dependencies relied upon at the remote point values, and the data verification component generates no 
may not be referenced in documentation at the data defini- 45 compile-time error message when the value of the data- 
tion point. When modifications are later made to the com- dependency expression is in the first set of predefined 
puter program, the data dependencies may not be apparent values. 

to the programmer making the modifications, with the result According to a further aspect of the invention, the com- 

that errors may arise in the program. p jj er extension includes a data verification component which 

Typically, computer programmers use programming lan- 50 further comprises a component for including in the compile- 

guage features which can be utilized to verify such data time error message reference to the data-dependency expres- 

dependencies at run-time. An example of such a computing sion. 

language function is the assert macro included in libraries According to a further aspect of the present invention, 
supplied with many compilers for the C programming mere j s provided a compiler extension for a programming 
language. Such macros or programming language features 55 language compiler, the compiler extension comprising a data 
will evaluate specified characteristics of data structures at verification construct for accepting as an argument a data- 
the point in the execution of the program where the state- dependency expression in a computer program, the data- 
meat or macro is found. dependency expression reflecting a programmer-defined 

The difficulty with such an approach is that additional relationship between data in the computer program, the 

code is introduced into the computer program (which in g 0 data-dependency expression having a one of a first set of 

some cases may be minimized or eliminated by compiler predefined values when the computer program conforms to 

optimization). To eliminate this additional code, it is neces- the programmer-defined relationship between data, and the 

sary to go through the step of removing the verifying macros data-dependency expression having a one of a second set of 

or statements in the program entirely after the testing phase predefined values different from the first set of predefined 

of the program development. es values, when the program does not conform to the user- 

The further difficulty with this prior art approach is that defined relationship between data, wherein upon compila- 

because the verification occurs at run-time, if data subject to tion of the computer program, the programming language 
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compiler generates a compile -time error message for the FIG. 2 is a flow diagram showing the compile-time 

compilation of the data verification construct when the value process flow of the preferred embodiment of the invention; 

of the data-dependency expression is in the second set of and 

predefined values, and no compile-time error message for FIG. 3 is a computer listing with compiler error messages 

compilation of the data verification construct when the value 5 generated by a compiler operating in accordance with the 

of the data-dependency expression is in the first set of preferred embodiment of the invention, 

predefined values. In the drawings, preferred embodiments of the invention 

According to a further aspect of the present invention, the are illustrated by way of example. It is to be expressly 

compiler extension described above is such that the compi- understood that the description and drawings are only for the 

lation of the data verification construct generates no run- to purpose of Olustrauon and as an aid to understanding, are not 

time code intended as a definition of the limits of the invention. 

According to a further aspect of the present invention, the DETAILED DESCRIPTION OF THE 

compiler extension described above is such that the compi- PREFERRED EMBODIMENTS OF THE 

lation of the data verification construct results in no alloca- INVENTION 

lion of run-time data storage. 15 Referring t0 nG. 1 there is illustrated in a block diagram 

According to one aspect of the present invention, there is showing the high level description, only, a structural repre- 

provided an improved method for verifying programmer- sentation of a computer program to be compiled by a 

defined data dependencies at compile-time, comprising the compiler and the compiler extension of the preferred 

steps of 20 embodiment of the present invention. The example com- 

a) defining a check statement in a computer program, the puter program includes a program block 10, a data definition 
check statement having an argument, block 12 in which the data structures which are subject to 

b) inserting the check statement in the computer program data dependencies are defined. Block 14 is a section of 
at points where a data structure having a programmer- program code, or module, including code to verify the data 
defined data dependency is accessed by the computer 25 dependencies and code which, when executed, relies on the 
program, defined data dependencies, exemplified by block 16. Blocks 

c) passing as an argument to check statement, a data- 18 an d 20 represent further sections of computer code also 
dependency expression having a one of a first set of including verification of data dependencies and, in block 20, 
predefined values when the data structure conforms to code wmch relies on » me data dependencies. 

the programmer-defined data-dependency and having a 30 In FIG- 2, a compile-time flow diagram is represented 
one of a second set of predefined values different from showing in schematic terms, the compile-time flow of the 
the first set of predefined values when the data structure compiler process, in accordance with the preferred embodi- 
does not conform to the programmer-defined data- ment of the invention. Block 30 represents initial compile- 
dependency, time processes. Block 32 represents compilation of a veri- 
the check statement having a definition such that the 35 fication macro definition. Block 34 represents the 
check statement will compile successfully where the compilation of data definitions, wherein the data is subject 
value of the data-dependency expression is in the first to data dependencies. Block 36 represents the macro expan- 
se! of predefined values, and unsuccessfully where the SI0n of the verification macro as found in the program code, 
value of the data-dependency expression is in the Block 38 represents the compiler generated error message 
second set of predefined values. 40 resulting from a compiler finding an error in the macro 
According to one aspect of the above method of the expansion. Block 40 represents the compilation of the code 
invention, the successful compilation of the check-statement whlch makcs of *c data dependencies, 
will result in minimal or no additional run-time overhead. 1° FIG. 3, code illustrating an example of the compiler of 
According to one aspect of the above method of the the invention is set out. The compiler improvement of the 
invention, the successful compilation of the check-statement 45 present invention may be implemented as a component of a 
will result in the generation of no additional run-time code. supplied library of macros, or by way of a compiler exten- 
According to one aspect of the above method of the sion built into the compiler program itself . In this description 
invention, the successful compilation of the check-statement of the invention, each of those implementations will be 
will result in no additional run-time data storage allocation. referred to as a compiler extension. When implemented as 
Advantages of the present invention include a compiler or 50 an extension incorporated into the compiler itself, the imple- 
assembler which is capable of identifying errors in the use mentation of the invention will result in the compiler exten- 
of data subject to data dependencies at compile-time. The sion itself generating error messages at compile-time. When 
invention provides the advantage of identifying such errors, implemented as a compiler extension which is discrete from 
without introducing additional run-time data storage costs or the compiler proper, the invention will utilize a data- 
increased run-time code. Further advantages include a com- 55 verification construct which will be compiled in such a way 
piler which provides the opportunity to identify all data- ^ to result in the compiler generating appropriate error 
dependency errors at compile-time, not only the errors messages. The invention may also be implemented as a 
which might be located in code executed during run-time standalone macro definition, as is shown in FIG. 3. 
testing of the program. Those skilled in the art will appreciate the modifications 

60 necessary to implement the present invention using above 

BRIEF DESCRIPTION OF THE DRAWINGS referenced, or other, standard programming methodologies. 

Preferred embodiments of the present invention will now Those skilled in the art will also understand that the present 

be described, by way of example only, with reference to the invention is applicable to both compilers and assemblers, 

accompanying drawings in which: The present invention is described with relation to compilers 

FIG. 1 is a block diagram showing the structure of a 65 but reference to compilers will be taken to include 

computer program written in accordance with the preferred assemblers, where understood to be appropriate by those 

embodiment of the invention; skilled in the art. 
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In this detailed description, a preferred embodiment will As can be seen from FIG. 3, where the expressions which 

be described in which the compiler improvement is imple- are arguments of the "Verify" macro are not true, compile - 

mented by way of a macro definition in the high-level time error message is generated. It may be noted that FIG. 

computer language C++. In the preferred embodiment of the 3 omits the code where the data structure subject to the data 

present invention, implemented by way of a macro 5 dependency is accessed. It will be understood that in a 

definition, a macro is defined which accepts an expression as program in which the Verify macro is used, the Verify macro 

an argument or parameter. The macro definition is capable of will proceed the code accessing the data structure, as indi- 

expansion to a statement to be compiled by the compiler but cated in FIG. 2. 

the statement compiled does not give rise to memory alio- As can be appreciated, other constructs may be used for 

cation at run-time or code which requires execution at 10 the macro definition. As long as such constructs do not result 

run-time. In the example of the preferred embodiment in the generation of run-time memory allocation or execut- 

described in FIG. 3, the macro definition comprises a type able code, the advantage of the invention as illustrated in the 

definition which is an array of a size dependent on the preferred embodiment will be achieved, 

parameter or argument of the macro. Upon compilation of As also indicated above, this approach to the compile- 

the verification macro definition, the compiler creates a 15 time detection of data dependency errors can be achieved by 

compile-time macro which accepts an argument or a param- incorporating the features illustrated in FIG. 3 by way of a 

eter. When the macro is expanded during compilation, the compiler extension which is achieved not through the use of 

value of the parameter expression (the data-verification macro definitions or libraries but by way of coding new 

expression) will give rise to a statement which can be language features into the compiler itself. The implementa- 

compiled successfully or which will create an error message 20 ti on 0 f sucn aD alternative is understood by those skilled in 

in the compiler, depending on the value of the data- the art of compiler construction. The advantage of such an 

verification expression. The macro as used by the program- implementation is that such a compiler may be designed to 

mer may be considered to be a check statement, which produce an error message or alert which is tailored by the 

accepts a data-verification expression, the successful com- compiler to provide information to the programmer relating 

pilation of the check-statement depending on the value of 25 to tne data dependency which has not been observed by the 

the data -verification expression. According to the preferred computer program code which generates the error message, 

embodiment of the invention, the data definitions which are Such an error message may include information relating to 

subject to the data dependencies are compiled in the usual or derived from the data-dependency expression, 

manner, as signified by block 34 in FIG. 2. The program While the mve ntion has been particularly shown and 

code will include portions which manipulate the data in a 30 described with respect to preferred embodiments thereof, it 

manner which relies on the adherence to the data dependen- ^ ^ understood by those skilled in the art that the 

cies by the programmer. Examples of these are shown in foregoing and other changes in form and details may be 

block 16 and 20 in FIG. 1. made therein without departing form the spirit and scope of 

The preferred embodiment of the present invention pro- the invention, 
vides compile-time data dependency verification in the 3S Having thus described my invention, what I claim as new, 
following manner. The macro as defined in block 32 of FIG. an d desire to secure by Letters Patent is: 
2 is included in the program code in the portion of the code \ a compiler extension residing in a computer system 
where the data is to be manipulated pursuant to the data comprising a data verification component for compiling a 
dependency rules. This is illustrated in blocks 14 and 18 of data-dependency expression contained in a computer 
FIG. 1. When the macro is expanded as is represented in 40 program, the data -dependency expression reflecting a 
block 36 of FIG. 2, expressions which define the data programmer-defined relationship between data in the corn- 
dependencies are passed as parameters to the macro. This is p U t er program, the data -dependency expression having a one 
seen in FIG. 3 in lines 24, 26, 31 and 33. These lines of the Q f a first set of predefined values when the program con- 
code show the macro "Verify" taking as arguments the data forms to the programmer-defined relationship between data, 
dependency expressions used in the example of FIG. 3: that 45 an d the data-dependency expression having a one of a 
the size of structure x and structure y are identical and that second set of predefined values different from the first set of 
the data items (x,i) and (xj) are adjacent. In the first predefined values, when the computer program does not 
example, the "Verify" macro is being used as check conform to the user-defined relationship between data, 
statement, taking the expression "sizeof(x)=~sizeof(y)" as wherein upon compiling the data-dependency expression, 
the data dependency expression which expresses the 50 the data verification component generates no run-time code 
programmer-defined relationship between the data. C r run-time data storage allocation, and upon compiling the 

In the example of FIG. 3, the macro expansion where the data-dependency expression, the data verification compo- 
expression passed as a parameter has the value "true" will nent generates a compile-time error message when the value 
proceed with no compile-time error. As a result, a data type of the data-dependency expression is in the second set of 
will be defined. A data type definition has no run-time 55 predefined values, and the data verification component gen- 
overhead in either memory or execution time and therefore erates no compile-time error message when the value of the 
there is no overhead introduced into the resultant execution data-dependency expression is in the first set of predefined 
code. Where, however, the expression passed as an argument values. 

is false, the compiler will identify a compile-time error. 2. The compiler extension of claim 1 in which the data 

Thus, where a data structure is changed in a way which is 60 verification component further comprises a component for 

inconsistent with the defined data dependency, a compile- including in the compile-time error message reference to the 

time error will be generated and the programmer will be data-dependency expression. 

alerted that there is a change to the program which breaks 3. The compiler extension of claim 1 for a compiler of a 
the data dependency, and this alert (or error message) will computer language which supports programmer-defined 
appear at the point in the code where the dependency is 65 data typing and in which the data verification component 
introduced. In either event, it is to be noted from the flow utilizes the compiler data typing for compiling the data- 
chart of FIG. 2, the data dependent code will be compiled. dependency expression. 
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4. A compiler extension for a programming language 
compiler residing in a computer system, the compiler exten- 
sion comprising a data verification construct for accepting as 
an argument a data-dependency expression in a computer 
program, 

the data-dependency expression reflecting a programmer- 
defined relationship between data in the computer 
program, 

the data-dependency expression having a one of a first set 
of predefined values when the computer program con- 
forms to the programmer-defined relationship between 
data, and the data-dependency expression having a one 
of a second set of predefined values different from the 
first set of predefined values, when the program does 
not conform to the user-defined relationship between 
data, 

wherein upon compilation of the computer program, the 
programming language compiler generates a compile- 
time error message for the compilation of the data 
verification construct when the value of the data- 
dependency expression is in the second set of pre- 
defined values, and no compile-time error message for 
compilation of the data verification construct when the 
value of the data-dependency expression is in the first 
set of predefined values. 

5. The compiler extension of claim 4 in which the 
compilation of the data verification construct generates no 
run-time code. 

6. The compiler extension of claim 4 in which the 
compilation of the data verification construct results in no 
allocation of run-time data storage. 

7. The compiler extension of claim 4 in which the 
programming language compiler supports programmer- 
defined data typing and in which the data verification 
construct utilizes the programming language compiler data 
typing to cause the programming language compiler to 
generate the error message. 

8. The compiler extension of claim 7 in which the 
compiler extension is contained in a macro library for a C++ 
programming language compiler. 



20 
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9. A method for verifying programmer-defined data 
dependencies at compile-time, comprising the steps of 

a) defining a check statement in a computer program, the 
check statement having an argument, 

b) inserting the check statement in the computer program 
at points where a data structure having a programmer- 
defined data dependency is accessed by the computer 
program, 

c) passing as an argument to check statement, a data- 
dependency expression having a one of a first set of 
predefined values when the data structure conforms to 
the programmer-defined data -dependency and having a 
one of a second set of predefined values different from 
the first set of predefined values when the data structure 
does not conform to the programmer-defined data- 
dependency, 

the check statement having a definition such that the 
check statement will compile successfully where the 
value of the data-dependency expression is in the first 
set of predefined values, and unsuccessfully where the 
value of the data-dependency expression is in the 
second set of predefined values. 

10. The method of claim 9 in which the successful 
compilation of the check-statement will result in minimal 
additional run-time overhead. 

11. The method of claim 9 in which the successful 
compilation of the check-statement will result in the gen- 
eration of no additional run-time code. 

12. The method of claim 9 in which the successful 
compilation of the check-statement will result in no addi- 
tional run-time data storage allocation. 

13. The method of claim 9 in which the check statement 
is a compile-time macro. 

14. The method of claim 13 in which the compile-time 
macro is a programmer-defined data type definition. 

15. A computer-readable memory for storing the instruc- 
tions for use in the execution in a computer of the method 
of claim 9. 
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